Stochastic sensing of polynucleotides using patterned nanopores 
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The effect of the microscopic structure of a pore on polymer translocation is studied using Langevin 
dynamics simulation, and the consequence of introducing patterned stickiness inside the pore is 
investigated. It is found that the translocation process is extremely sensitive to the detailed structure 
of such patterns with faster than exponential dependence of translocation times on the stickiness 
of the pore. The stochastic nature of the translocation process leads to discernable differences 
between how polymers with different sequences go through specifically patterned pores. This notion 
is utilized to propose a stochastic sensing protocol for polynucleotides, and it is demonstrated that 
the method, which would be significantly faster than the existing methods, could be made arbitrarily 
robust. 

PACS numbers: 87.15.A-,87.16.Uv,36.20.Ey 
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I. INTRODUCTION 

The quest for efficient sequencing of single stranded 
DNA using synthetic nanopores has recently led to the 
development of a plethora of novel theoretical and ex- 
perimental design ideas that use a variety of different 
approaches [mD]- Experiments have demonstrated that 
the current blockade readout from single stranded DNA 
and RNA molecules that are electrophoretically driven 
through biological and synthetic nanopores could in prin- 

f'e reflect a signature of the underlying sequence fl- 
it is now possible to design solid-state nanopores 
[l6j with tailored surface properties that could regu- 
late DNA-pore surface interaction (l7l - fl9| and also reduce 
noise [20l . l2l| . A number of recent experiments have been 
successful in discriminating between polynucleotides (22j 
and identifying single nucleotides }23l425| . However, more 
remains to be done to resolve issues involving stability, 
sensitivity, and resolution, before they can be integrated 
into fast and efficient devices for sequencing purposes 

MM. 



Theoretical studies of polymer translocation through 
nanopores [28l - l5l| have revealed that the process is intrin- 
sically stochastic and features a rather wide distribution 
for the translocation time. The inherent noise acts as an 
overwhelming source of error for the sequence detection 
strategies that are based on deterministic patterns in the 
translocation time readout, unless the process is suffi- 
ciently slowed down such that time-averaging eliminates 
the noise @, [22j. In other words, achieving robustness 
in sequencing using deterministic strategies has intrinsic 
limitations, and might require significant compromise in 
translocation speed Q . 

Here, we propose a strategy to control the transloca- 
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tion time and its statistics by using pores that have pat- 
terned surface energetics. We then address the question 
of whether it is possible to engineer distinct stochastic 
features for the translocation of heteropolymers with any 
given sequence through different pores, such that the sta- 
tistical readout from combined translocation events of a 
particular sample through a collection of different pores 
could quickly and accurately reveal its sequence by syner- 
gistic exclusion of unlikely sequences. We start by study- 
ing the translocation of a homopolymer that is driven 
from the cis (entrance) to the trans (exit) side of a narrow 
pore by a uniform external field, F (Fig. [T] sec Appendix 
A). We vary the stickiness of the pore (characterized by 
the attractive strength, e pm ) along its length and con- 
sider three different examples (Fig. [lji-c). A uniformly 
attractive pore, Pore a, serves as the control case. Pore 
P is structured to have an attractive entrance and exit 
separated by a repulsive core, while Pore 7 is designed to 
have an attractive entrance and a repulsive exit. These 
apparently minor changes in the pore patterning turn out 
to have significant effects on the translocation times. 



II. TRANSLOCATION TIME DISTRIBUTIONS 
FOR HOMOPOLYMERS 

The translocation time (r) is divided into (i) the initial 
filling time (ti): the time taken by the first monomer of 
the polymer to reach the exit without returning to the 
pore, (ii) the transfer time (ra): the time taken from 
the exit of the first monomer into the trans side to the 
entry of the last monomer from the cis side, and (iii) 
the escape time (T3): the time between entry of the last 
monomer in the pore and its escape to the trans side; see 
Fig. QJi-c. These definitions arc better characterized by 
counting the number of monomers of the polymer on the 
cis side, N C - 1S , inside the pore, A pore , and on the trans 
side, At rans , as functions of time (Fig. [T]i), with N = 
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pore 



Atrans (see Supplementary Movie 1). 
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For Pores a and p, the filling time, t\, depends weakly 
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FIG. 1: The schematics of the polymer and nanopore models. Simulation snapshots showing the translocation of a homopolymer 
and heteropolymers with different block lengths across the three patterned pores at various stages of the translocation process, 
namely, filling, transfer and escape, (a) A homopolymer (yellow) translocating from the cis to the trans side of Pore a. The 
interaction of the pore monomers (red) with the polymer has an attractive well whereas the monomers that make up the walls 
of the pore (blue) have an excluded volume interaction with the polymer. The pore width is fixed and there is a constant force 
driving the polymer that acts inside the pore, (b) A heteropolymer (poly(dAdC)ie) of block length M = 2 with alternating 
bases A (yellow) and C (green) translocating across Pore j3. Pore j3 consists of two sticky monomers on either end of the 
pore that are separated by a wall monomer. The bases A and C have different interactions with the sticky monomers, (c) A 
heteropolymer (poly(dA4dC4)s) of block length M = 8 translocating across Pore 7, which has two sticky monomers on the cis 
side and three repulsive monomers on the trans side that result in an attractive entrance and a repulsive exit. (Inset) Shows 
the pore-polymer potentials, (d) A trace of the monomer count at the trans end, middle, and cis end of the pore as functions 
of time for a homopolymer translocating through Pore 7, with e pm = 3 and F — 0.5. Ti, T2 and T3 change dramatically when 
pore patterning is introduced. 



on the stickiness of the pore (Fig. [2J1). In the presence of 
the weak driving force, Pore /3 has a shallower potential 
well near the cis end which reduces trapping time making 
filling easier (see Appendix B). The barrier encountered 
near the core is small enough to be overcome by the fluc- 
tuations of the polymer. With increasing e pm , the effect 
of trapping becomes more dominant and thus the differ- 
ence in T\ between pores a and /3 increases. Pore 7, which 
has a repulsive exit, takes a relatively longer time to fill. 
The large potential barrier beyond the cis side slows down 
the polymer increasingly more as the entrance becomes 
stickier (with increasing e pm ). The distribution of filling 
times shows a relatively longer exponential tail for Pore 
7 due to this potential barrier. In sharp contrast to the 
filling time, the transfer time T2 has a much more regu- 
lar behavior with increasing stickiness of the pores (Fig. 
[2^). The transfer of the polymer over the length of the 
pore depends on the potential landscape inside the pore: 
Pore a, which is attractive throughout, has the longest 
transfer time, while Pore 7, which is the least attrac- 
tive pore, has the shortest r 2 . Figure [2^ shows that the 
transfer time depends exponentially on e pm , and the dif- 
ference in scales across the three pores is consistent with 
the number of attractive beads inside each pore. The es- 
cape time, T3, depends strongly on the pore interaction 
near the exit, and differs most dramatically across the 
three pores (Fig. [5f). In this time interval, the polymer 
is already inside the pore and to escape the pore it needs 
to overcome the potential barrier near the exit. The de- 
pendence of the escape time on e pm turns out to be faster 



than exponential, which suggests that seemingly insignif- 
icant changes in the stickiness pattern and strength of the 
wall of the pore could modify the translocation time by 
several orders of magnitude. 

The average total translocation time for the homopoly- 
mer across all the pores is plotted in Fig. [2g, which shows 
that for the relatively weak external force used here the 
translocation process is controlled by the escape mode 
(see Fig. HJi-g). The overall translocation time distri- 
butions for the three pores are also very different (Fig. 
Oi-c) , despite the fact that the general shape of the distri- 
butions for each mode of the translocation process were 
similar. The extreme sensitivity of the translocation dy- 
namics of the homopolymer on the pore patterning and 
stickiness suggests that it might be possible to engineer 
pores such that heteropolymers of any given sequence 
will have distinct statistical features that could be used 
for stochastic sequence detection. 



III. HETEROPOLYMER SEQUENCE SENSING 

To examine the feasibility of this sequencing strategy, 
we replace the homopolymer with heteropolymers con- 
structed in accordance with earlier experimental [l2j and 
theoretical [34| studies of polynucleotide translocation 
through nanopores; those containing symmetric purine- 
pyrimidine blocks of the form A n C n , with the block 
length M — 2n (Fig. [TJ sec Appendix A). We assign dif- 
ferent values to the attractive interactions of the sticky 
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FIG. 2: Translocation time statistics for homopolymers. (a-c) Comparison of translocation time distributions for the three 
patterned pores, for F — 0.5 and e pm = 2.0. The filling, transfer and escape distributions are similar across the three pores, 
but have distinctly different scales (e.g. average and variance), such that the overall translocation time distribution for the 
three pores are discernably different, (d-f) Comparison of average filling, transfer and escape times for the three different pores 
for F = 0.5 as a function of e pm . While the filling time shows only a moderate dependence on the stickiness, and the transfer 
time exhibits an exponential dependence on e pm , the dependence of the the escape time is even faster than exponential, (g) 
The total translocation time for the three pores as a function of e pm indicates that for small forces, the escape time dominates 
the translocation process. The orders of magnitude differences in the translocation times between different pores shows the 
extraordinary sensitivity of the translocation dynamics on pore patterning. 



beads in the pore with base A (e p a) and base C (e p c), 
with e p A > e pC . 

The translocation time distributions for five different 
sequences are shown in Fig. [3£i-b for Pores (3 and 7. We 
find that the different modes of translocation across the 
two pores respond differently to variations in the block 
length, such that the outcome for the total transloca- 
tion time exhibits distinct features (see Supplementary 
Movies 2 and 3 and Appendix C). To simplify the pic- 
ture, we summarize the distributions for each pore in a 
scatter plot by using only the two basic characteristics of 
mean and standard deviation (Fig. |4ji-d). We observe a 
number of interesting features. For example, both mean 
time and standard deviation seem to roughly increase 
with block length for Pore /3, whereas for Pore 7 mean 



time initially increases with block length, peaks at n = 4 
and goes back to smaller values for longer blocks. While 
Pore /? cannot easily distinguish between (^464)4 and 
(A 2 C2)s, Pore 7 can, and the reverse is true for (^464)4 
and {A$C%)2- We have also examined the effect of the 
orientation of the heteropolymer when it enters the pore, 
and considered polymers of total length N = 32 and 
N = 64 (Fig. m,-d). The differences in the scatter im- 
mediately suggests that a combined translocation time 
measurement across the two pores and comparison with 
the statistics of the known sequences could help identify 
an unknown sequence to a high accuracy. 

To demonstrate this idea and probe its statistical ro- 
bustness as a sequencing strategy, we run a test on 
a model sequencing device that would be made up of 
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FIG. 3: Translocation time statistics for heteropolymers. Comparison of filling, transfer, escape and translocation time distri- 
butions for Pores j3 (a) and 7 (b) and five different sequences of the heteropolymer. The distributions correspond to F = 1.0 
for Pore /3, and F = 0.5 for Pore 7, respectively. 



multiple copies of Pores /? and 7 that are arranged in- 
series, such that the readout from translocation of a given 
polynucleotide with an unknown sequence through all of 
them can be independently recorded. We calculate the 
average and standard deviation of the translocation times 
through the Pores (3 and 7, separately, using their cor- 
responding multiple readouts. Using the difference be- 
tween the measured means and standard deviations and 
the tabulated values for known sequences through each 
pore, we calculate the relative error for each sequence 
and minimize it for all sequences across both pores to 
find the closest match, which will be returned as the pre- 
dicted sequence. The ratio of the number of successful 
sequence detection events and the total number of at- 
tempts, which is defined as the accuracy of the statistical 
sequence detection algorithm, turns out to be remarkably 
high (Fig. Hk). For N = 32 and fixed orientation of the 
polynucleotide for all pores, the accuracy starts off at 
75% with just the minimum two copies of each pore and 
rises quickly to above 95% when there are ten copies of 
each pore. 

Instead of just using the first two moments, we can 
choose to use the full translocation time distributions for 
the sequence detection, using the following method. If 
we make a measurement of the translocation time (r) of 
a polymer with an unknown sequence through a given 
pore (Pore (3, say), then the probability of the time 
being part of a distribution of a known sequence (say 
n) is Vn(r), where V n is the known probability distri- 
bution. After m measurements, the likelihood of the 
translocation times being part of a given distribution 



can be defined as C n = n™ =1 'P^(Ti). The structure of 
the unknown heteropolymer is determined by finding the 
n with the maximum likelihood, seq = scq[max{£„}]. 
For multiple pores and fixed orientation of the polymer 
through all of them, the likelihood can be generalized to 
£ l ° = n™ 1 ^(T l )n™ 1 ^(r J ).... Figure IB shows the re- 
sulting accuracy plots as obtained using the full translo- 
cation time distributions, which exhibit a considerably 
faster convergence in the algorithm. The corresponding 
results are very similar for N = 64 (Fig. 0J:). This shows 
that an inherently statistical DNA sequencing strategy 
could be designed to have an arbitrary accuracy. 

During the sequence detection process the heteropoly- 
mer could enter the pore with either base A or base 
C entering first. Therefore it is imperative to consider 
orientation effects on the translocation time distribu- 
tions, as seen in Fig. @Jt-d, and hence on our sequenc- 
ing strategy. Due to the possibility of orientation flips 
during multiple readouts of the unknown sequence, we 
need to consider all permutations of the two orienta- 
tions in a given set of readouts. When we incorporate 
the possibility of different orientation in the transloca- 
tion time measurements, then we would need to con- 
sider the sum of all possible permutations of orienta- 
tions in determining the likelihood of the translocation 
times being part of a given distribution. This leads 



to CI 



for the 



random orientation case. Again, we count the number of 
successful sequence detection events and plot the accu- 
racy of this detection algorithm, as shown in Fig@J. The 



5 



Pore B N=32 




(b) 

1600 



Pore 3 iV=64 



1 


1 I 


e 




e * 

32 8 






16 




32 
» 

4) 

, » 


2 

2 




O 


1 1 





Pore 7 JV=64 
r 




) 1000 1500 2000 2500 1000 2000 3000 4000 5000 
mean translocation time mean translocation time 

( e ) Fixed Orientation 



5000 10000 15000 
mean translocation time 



10000 20000 30000 40000 
mean translocation time 



Random Orientation 




"T" 



-X 4-- 



— A — Moment A r -32- 
— ■— Moment jv=64. 
— * — Full jV-32 



_l_ 



_l_ 



II AT— 64 
J l_ 



3 4 5 6 7 8 
samples per pore type 




3 4 5 6 7 
samples per pore type 



FIG. 4: Using translocation time statistics to detect polynucleotide sequences, (a-d) Scatter plots showing the distinctive 
mean and standard deviations of the different sequences and for different ends of the polynucleotide entering the pore. The 
mean translocation time, (r), and its standard deviation, \J (r 2 ) — (r) 2 , for the different sequences are calculated from the 
distributions. The scatter plots reveal the distinctive characteristics of the translocation events for the different sequences 
through each pore. The plots correspond to (a) Pore /3 with F — 1.0 and N = 32 (b) Pore j3 with F — 1.0 and N — 64 (c) Pore 
7 with F — 0.5 and N — 32 and (d) Pore 7 with F — 0.5 and N = 64. (e-f) Accuracy of sequence detection using multiple 
joint translocation events through Pores j3 and 7. The plots are constructed by recording a given number of translocation times 
through Pores /3 and 7, and using a comparison with either the full distribution or the first two moments of the distribution 
shown in the scatter plots (a-d). For the method that uses the moments, the sample average and standard deviation are 
calculated and used to find the relative error of the sample average and standard deviation compared to the known values and 
compound them into an error metric for each pore and each sequence. The error metric is subsequently used for each pore to 
define a closeness metric, which will be minimized to predict the sequence. The accuracy is the ratio between the number of 
successful predictions and the total number of attempts. In (e) the orientation of the polymer is known and preserved when 
it passes through the different pores. In (f) the orientation of the polymer is not known and randomly changes when it passes 
through the different pores. 



result of this calculation shows that using the full dis- 
tribution is surprisingly robust with respect to the ran- 
domization of the orientation, which is of paramount im- 
portance in practice. For the set of measurements which 
do not involve orientation effects we observe a distinctly 
faster detection of a sequence (95% when there are 4 
copies of each pore) as compared to the detection using 
the scatter plots (Fig. |4^). With the orientations of the 
polymer as it enters the pore taken into account, the ac- 
curacy of detection rises to 95% with just 5 copies of 
each pore (Fig. 



polymer-pore interactions and combining readouts from 
multiple pores for rapid convergence. The desired pat- 
terns in surface interaction could be achieved by u sing bi- 
ological nanopores with appropriate modification 53,l54| 
or those with known hydrophobic-hydrophilic pattern 
structure [55j , as well as solid-state nanopores with tailor- 
made surface interactions [l5l - tl9| . The proposed ap- 
proach could potentially improve the overall speed of se- 
quence detection by orders of magnitude, and could be 
integrated in high throughput microfiuidic devices. 



IV. CONCLUSION 



In contrast to the generally accepted notion of sup- 
pressing the stochastic element of polynucleotide mo- 
tion through nanopores to achieve efficient DNA sequenc- 
ing, we propose to extract information from the statisti- 
cal fluctuations towards sequence detection. Our strat- 
egy is based on designing distinguishable translocation 
time statistics for any given sequence by engineering the 



Acknowledgments 



We would like to thank G. Battaglia for fruitful discus- 
sions. This work was supported by grant EP/G062137/1 
from the EPSRC. 



6 




-4 -3 -2 -1 1 2 3 4 -4 -3 -2 -1 1 2 3 4 -4 -3 -2 -1 1 2 3 4 

XXX 



FIG. 5: Comparison of the polymer-pore interaction potentials. (Top left) A schematic of the pore. Pore monomers could 
either have an attractive (LJ) interaction (red) or a short range repulsive (rLJ) interaction (blue) with the polymer inside the 
pore. (Top right) The potential energy landscape in the center (y = 0) along the length of the channel (blue) is modified 
(green) in the presence of an external driving force F = 0.5. (Bottom) The complete potential energy landscape experienced 
by the polymer inside the pore. Blue to red represents increasing potential depth. 



Appendix A: Methods 

Homopolymer model. We model the polymer as a self 
avoiding chain by using beads and springs (Fig. [I}. The 
beads represent monomer groups of the polymer and we 
model the excluded volume interaction between a pair of 
monomers by a truncated repulsive Lennard- Jones (rLJ) 
potential of the form 



U LJ (r) 

^ mm V / 




r <r n 
r > r n 



where e is the potential depth and a is the monomer 
diameter. The cut-off distance, r min = 2 1 / 6 cr, is set at 
the potential minimum. The bonding springs between 
monomer groups are modelled by a finite extension non- 
linear elastic (FENE) potential of the form 



f r 2 



where k = If. jo 2 and R = 2a are the spring constant 
and bond length respectively. FENE potentials are con- 
venient as the bond length effectively sets the maximum 
allowed separation between monomer groups. We use 
polymers of length N = 32 and N = 64 in our simula- 
tions. 

Heteropolymer model. We model the heteropolymers 
similarly using beads and springs (Fig. [IJ with the poly- 
mer beads representing the bases A and C arranged in 
symmetric blocks A n C n . With a DNA of length N = 32, 
the minimum value of n = 1 is for poly(dAdC)i6 and the 
maximum value of n = N/2 for poly(dAigdCi6). The 
bases A and C are only distinguished by their relative 
interactions with the pore. 

Pore model. The pore and wall are constructed from 
stationary monomers separated by a distance of a from 
each other. The pore is made up of two rows of monomers 



symmetric about the coordinate system with a length 
L = 5cr and separated by a width of W = 2.25c. The 
pore width is chosen to allow only single file translocation 
of the polymer and avoid hair-pin configurations. The 
polymer translocates from the cis (entrance) end to the 
trans (exit) end of the pore (Fig. [I]). The walls of the 
pore extend in the y direction. 

Polymer-pore interaction. The interaction of the pore 
with the polymer is tuned such that the interaction varies 
along the length of the pore. This interaction could either 
be the short-range repulsive form described above or the 
standard LJ form: 



U L \r) 



4e r 



, 12 



: r <r c 
: r > r c 



with e pm denoting the potential depth and r c = 2.ba 
denoting the cut-off distance. We choose three differ- 
ent pore patterns with the patterning symmetric about 
the x-axis: (1) Pore a is an attractive pore with all the 
monomers of the pore interacting with the polymer by 
the LJ potential. (2) Pore j3 has an attractive entrance 
and exit with the first two monomers and the last two 
monomers of the pore interacting with the polymer by 
the LJ potential and the middle monomer being repul- 
sive. (3) Pore 7 has an attractive entrance (first two 
monomers attractive) and a repulsive exit (last three 
monomers repulsive). Note that in all the three cases the 
pore entrance is chosen to be attractive to successfully 
initiate translocation. The stickiness of the pore (e pm ) 
is varied during homopolymer translocation. During the 
translocation of the heteropolymer the stickiness differs 
for base A (c p a) and base C (e p c)- We fix these values 
to e p A = 3.0 and e p c = 1.0 respectively. The polymer in- 
teracts with the wall (f/^w) with the same rLJ potential 
as used for the intra-monomer excluded volume interac- 
tion. In addition the polymer experiences a driving force, 
F c = Fx directed along the pore axis with magnitude F, 
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FIG. 6: Comparison of the filling, transfer, escape and 
translocation time distributions for the three pores at F = 0.5 
and e = 1.0 



which mimics the elcctrophorctic driving of biopolymers 
through nanopores. 

Polymer injection. In our simulation we are not con- 
cerned with injection of the polymer into the pore, but 
only with the dynamics of the polymer during translo- 
cation. We initially place the first bead of the polymer 
chain at the entrance of the pore and allow the remain- 
ing beads to fluctuate. Once the polymer relaxes to its 
equilibrium configuration, the bead is released and the 
translocation of the polymer across the pore is monitored. 
The translocation time is defined as the time that elapses 
between the entrance of the first bead of the polymer and 
the exit of the last bead. All failed translocation events 
are discarded. 

Integration algorithm. The equations of motion of 
the monomers of the polymer were integrated using a 
Langevin dynamics (LP ) algorithm that includes a ve- 
locity Verlet update [52|. Within the LD formalism, the 
interaction of the monomers with a solvent is simulated 
by a viscous drag term proportional to the monomer ve- 
locity and a random force term modeled by Gaussian 
white noise with an auto-correlation function that satis- 
fies the fluctuation-dissipation theorem. The equation of 
motion for a monomer therefore takes the form: 



(Vj +Vi, 



where m is the monomer mass, Ui 

LJ 



LJ 

mm 



rrFENE 
^ch 



l^vm + Uh^ is the total potential experienced by a 
monomer, £ is the friction coefficient, is the monomer 
velocity, and r\ i is the random force with (Viitf-Vjito)) = 
4:kBT£6ij5(t — to), T being the temperature. A time step 
of At = 0.01 is used in all simulation runs. 

Reduced units. The units of energy, length, and mass 
are set by e, a, and m, respectively. These set the scale 
for the time as (ma 2 /e) 1 / 2 . Following Luo et al. [Hj], we 
assume that the size of each bead in our coarse-grained 
polymer model corresponds to the Kuhn length of a single 
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FIG. 7: Comparison of the filling, transfer, escape and 
translocation time distributions for the three pores at F = 0.5 
and e = 1.5 
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FIG. 8: Comparison of the filling, transfer, escape and 
translocation time distributions for the three pores at F = 0.5 
and e = 2.0 



stranded DNA, which is approximately three nucleotide 
bases. This sets the bead size, a ps 1.5 nm, the mass 
of the bead, m ps 936 amu (given that the mass of a 
base in DNA is ps 312 amu) and the charge of a bead, 
q ps 0.3 e (each base having a charge of 0.1 e effectively 
[56]). We set £ = 0.7 and fc^T = 1.2 to allow comparison 
with known results. Therefore, the interaction strength 
at T = 295 K is given by e = fc B T/1.2 ps 3.4 x 10" 21 J. 
This gives the time scale of (ma 2 / 'e) 1 / 2 w 30 ps and a 
force scale of e/a « 2.3 pN. Therefore an external driv- 
ing force in the range 0.5 — 1.0 corresponds to a volt- 
age range V = FL/q ps 190 — 380 mV across the pores. 
Note, however, that higher values of up to 0.5 e for the 
effective base charge have also been reported in the lit- 
erature which suggest that the appropriate voltage 
range could be lower than the above-mentioned values. 
As a rough indication of how much the patterning could 
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FIG. 9: Comparison of the filling, transfer, escape and 
translocation time distributions for the three pores at F = 1.0 
and W = 2.5 using heteropolymers with different sequences. 
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FIG. If: Comparison of the filling, transfer, escape and 
translocation time distributions for the Pores j3 and 7 at 
F — 0.5 and W = 2.5 using heteropolymers with different 
sequences. Translocation through Pore a is extremely slow 
for these range of values and are not considered for this anal- 
ysis. 
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FIG. 12: Comparison of the filling, transfer, escape and 
translocation time distributions for the Pores /3 and 7 at 
F = 0.5 and W = 2.25 using heteropolymers with differ- 
ent sequences. Translocation through Pore a is extremely 
slow for these range of values and are not considered for this 
analysis. 



FIG. 10: Comparison of the filling, transfer, escape and 
translocation time distributions for the three pores at F = 1 .0 
and W = 2.25 using heteropolymers with different sequences. 

affect translocation speed, we note the example of a ho- 
mopolymer with e pm = 2.5 (see Fig. [2g), which yields 
the translocation time of 100 /js for Pore a and 0.1 /xs for 
Pore 7. These figures are consistent with the typical Q 
observed translocation rates of a single nuclcotidc//xs. 



Appendix B: Sensitivity of translocation time 
distributions on pore-polymer interaction 

The translocation of a homopolymcr through a nar- 
row pore has been theoreti cally studied extensively in 
the past [H-im. Luo et al. [11 HHH] investigated the 
pore-polymer interaction in a uniformly attractive pore, 
which we chose as the control pore, Pore a, in our simula- 
tions. In Fig. [SJ we compare the potential landscapes for 
Pores a, /3, and 7, respectively. The potential landscapes 



seen by the polymer at the center of the pore along its 
length reveal the positions of the potential barriers for 
the three pores. Pore a has a strong barrier at the exit 
due to the stickiness of the pore. Pore /3 — that has a re- 
pulsive part in the middle — has a far lower barrier at the 
exit. It does, however, experience a small barrier just 
after the entrance, which it overcomes easily. This ex- 
plains why the filling time for Pore /3 is lower than that 
of Pore a. Pore 7, which has an attractive entrance but a 
repulsive exit, has a large barrier at the entrance, which 
makes the filling time relatively longer as compared the 
other pores. However, the escape time for the polymer 
in Pore 7 is vastly reduced due to the repulsive exit. 

In Figs. [(J [Jj and[8l we observe the change of the dis- 
tribution for the translocation times with increasing at- 
tractive strengths, e pm for the the three different pores. 
For Pore a, we note the transition from a Gaussian form 
to a long-tailed distribution with increasing attraction, 
which was observed by Luo et al. This transition is ob- 
served in all the three pores, although for Pores /3 and 7 
they happen at higher attractive strengths. 
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Appendix C: Optimizing the pore for sequencing: 
the effect of driving force and pore width 

To understand the effects of patterning the pore on se- 
quencing, we considered the translocation of heteropoly- 
mers through the pores. Following Luo et al. [34j . the 
polymers were represented as consisting of symmetric 
blocks A„C n of A and C bases, which interact differ- 
ently with the pore. The time distributions for the three 
pores show a varying degree of sensitivity on the spe- 
cific sequence of the polymer (Figs. [UJ QUI HU and IT2l. 
depending on the strength of the external force and the 
pore width. In Fig. |H1 we show the dependence of the 
distributions for F = 1.0 and W = 2.5. For Pore a, 
the difference in distributions for short block lengths is 
relatively small. As the block lengths are increased, the 
distribution changes sharply. However, for larger block 
lengths it becomes difficult again to distinguish them. 
For Pores (3 and 7, the distributions have a high degree 
of overlap and are not suitable for sequencing. 

As the width is decreased (Fig. [TOl W = 2.25), Pore 
a takes extremely long to translocate for larger block 
lengths. The potential barrier proves difficult to sur- 
mount and the polymer is stuck for long periods inside 



the pore. However, lowering the width has a positive 
impact on Pore /3 which leads to translocation time dis- 
tributions that can distinguished from one another. Al- 
though the translocation times are much longer, the dis- 
tributions are well separated by their means and standard 
deviations. This impact is much less for Pore 7. 

On the other hand, we could keep the pore width fixed 
(W = 2.5) and lower the strength of the external driving 
force (Fig. [TTj) . The effect on Pore a is drastic as the 
polymers fail to cross the potential barrier. Pore (3 and 
7, on the other hand, still translocate polymers although 
their distributions for the different sequences are far from 
distinguishable. 

Finally, we keep the width at W = 2.25 and lower 
the force to F = 0.5 (Fig. [T2")) . This completely takes 
out Pore a from consideration as the translocation times 
become prohibitively long. The translocation time scales 
are now much longer for Pores /3 and 7 as well. However, 
the mean and standard deviations for Pore 7 are again 
well separated making it easier to distinguish between 
the distributions, and hence make it a suitable candidate 
for sequencing. In our simulations, we use Pore j3 at 
F = 1.0, W = 2.25 and Pore 7 at F = 0.5, W = 2.25 as 
the two most suitable pores for our sequencer. 
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